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© Recombinant ONA; transformed microorganisms, plant cells and plants; a process for Introducing 
^ an Inducible property in plants, and a process for producing a polypeptide or protein by means of 
^ plants or plant cells. 



JJJ® "^Is invention relates to recombinant DNA com- 
prising vector-DNA and a DNA sequence corre- 
1^ spending with, or relates to, a salicylate-inducible 
£2 promoter of a GRP gene of plants, such as tobacco 
plants. The invention also relates to microorganisms, 
O plant cells and plants transfonned using the recom- 
(^binant DNA, to a process for introducing an inducible 
UJ property In plants and to a process for producing a 
polypeptide or protein, using plant cells and plants 
transformed using the recombinant DNA. 
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Recombinant DNA; transformed microorganisms, piant ceils and plants; a process for introducing an 
Inducible property in plants, and a process for producing a polypeptide or protein by means of plants 
or piant cells 



A. Field of the invention 



i in the field of DNA recom- 
binant technology and is based on the identification 
of GRP (glycine-rich protein) genes occurring in 
plants with a salicylate-inducible promoter. More 
particularly the invention relates to the use of such 
a salicylate-inducible promoter. 



B. State of the art 

Plants are continuously subject to influences 
from their environment, which may involve a threat. 
These influences may relate to such factors as 
temperature, light, humidity, salt and injuries, but 
also attack by pathogens, such as viruses, fungi, 
bacteria, insects and the lilce. For its survival, the 
plant has available a broad range of defensive 
mechanisms which are activated when the plant is 

Subject to ir 



tors. These proteins, which are best characterized 
in tomato and potato, have virtually no effect on 
proteolytic enzymes of the plant but specifically 
inhibit digestive enzymes of animals, in particular 

s those of insects. When a proteinase inhibitor gene 
of potato is Induced by injury, it is found that. Inter 
alia, base sequences are involved which are lo- 
cated downstream of the gene (Thomburg et at., 
1987). When a proteinase inhibitor gene is placed 

10 under the control of a constitutive promoter (the 
CaMV-35S-promoter) and expressed in transgenic 
plants, the plant is found to have become highly 
resistant to Insect damage (Hilder et al., 1987). 
To be able to defend itself against infection by 

rs pathogens, the plant has a mechanism known by 
the name of "hypersensitive response". When, as 
a result of infection, this mechanism becomes ac- 
tivated, the plant ceils infected die, and a lignin wall 
is formed around the centre of Infection, which the 

20 pathogen is unable to pass. This means that infec- 

B on r esul t s i n nec r otic l es i ons a t the cent r es o f 

infection, but the other parts of the plant remain 
virtually free of pathogen. Pathogens not activating 
the hypersensitive response may spread through- 

2S out the .entire plant and become accumulated to 
high coiicentrations. 

It has been found that in the case of a necrotic 
infection the pathogen-free parts of the plant de- 
velop a resistance to a second infection by a broad 

30 range of pathogens, such as viruses, fungi and 
bacteria ("acquired resistance"), no matter what 
type of pathogen caused the first infection. Thus a 
necrotic virus infection leads to resistance to fungi 
and vice versa . Owing to the necrotic infection, a 

35 large number of genes are induced in the 
pathogen-free parts of the piant (for a survey, see: 
Van Loon, 1982; Van Loon. 1985; Collinge and 
Slusarenko, 1987; Bol and Van Kan, 1988; Sol, 
1988; Van Loon, 1988). ft is supposed that pro- 

40 ducts coded for by the induced genes play a role 
In the acquired resistance. Part of the Induced 
genes code for enzymes which, starting from the 
amino acid phenylalanine, synthesize a diversity of 
aromatic compounds. These include compounds 

45 inhibiting the growth of fungi and called phytoalex- 
ins. They also Include precursors of the llgnin used 
in reinforcing cell walls and forming a barrier ar- 
ound a centre of infection. Another part of the 
induced genes codes for hydroxyproline-rich 

so glycoproteins (HRQP, extensin) which are incor- 
porated in the ceil wall and function as a matrix for 
attaching aromatic compounds, such as lignin. A 
third group of induced genes, finally, codes for 



activation is generally accompanied by the induc- 
tion of the expression of specific plant genes. This 
induction is controlled by control elements often 
present in the promoter region upstream of the 
gene in question. A given stress factor may either 
activate a highly specific set of plant genes, or 
result in a broad response of many defence genes. 
Thus an increase in temperature for a short period 
of time leads to the expression of so-called "heat 
shock" (HS) protein genes; the piant is subse- 
quently resistant to temperatures to which untreat- 
ed plants are not resistant A conserved sequence 
of about 14 basepairs occuning several times in 
the promoter region of HS protein genes has been 
found to be responsible for the induction of these 
genes (see e.g. Pelham and Srenz, 1982; Bienz, 
1985). 

Various light-inducible genes have meanwhile 
been cloned. When a sequence of several hun- 
dreds of basepairs located upstream of these 
genes is fused with a "reporter" gene, for example, 
the chloramphenicol-acetyl-transferase gene (CAT 
gene), this gene becomes ilght-lnduclble in trans- 
genic plants (see e.g. Kuhlemeier et al., 1987; 
Green et al., 1987; Stockhous et al.. 1987). In the 
promoter regions of a number of light-inducible 
genes, a common element of 9 basepairs can be 
distinguished, which is possibly involved In the light 
inducibiiity (Grob and Stliber, 1987). 

When plants are injured, either mechanically or 
from being eaten by insects, plant genes are ac- 
tivated, inter alia, which code for proteinase inhibi- 
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proteins which accumulate in the vacuole in the 
plant cell or are excreted in the intercellular space 
of the leaf. These so-called PR proteins 
(pathogenesis-related proteins) are best character- 
ized in tobacco but occur in the plant kingdom In a 



highly conserved form. For one part they lum out 
to be hydrolytic enzymes, such as chltlnases and 
glucanases, which In combination efficiently inhibit 
the growth of fungi on artificial media . Another PR 
protein is thought to Inhibit the digestive enzymes 
of insects. The function of the other PR proteins is 
unl<nown. 

White (1979) has found that the treatment of 
tobacco with certain aromatic compounds, such as 
salicylic acid (in the neutralized form) leads to the 
induction of a subgroup of PR proteins. I.e. the PR- 
1 proteins, and to a resistance to virus Infection. 
This was seen as an Indication that this subgroup 
of PR proteins is involved in the induced resistance 
to virus infection. Fraser (1983) argued against this 
that there are conditions which induce PR-1 pro- 
teins but do not generate an antiviral response. 
Hooft van Huijsduijnen'et al. (1986) cloned DNA 
copies of six classes of messenger-RNA (mRNAs) 
which in Samsun NN tobacco are Induced by to- 
bacco mosaic virus (TMV) Infection. Two of these 
classes of mRNAs are also Induced by salicylate. 
O i m o f l l iaso lu ii ibiJ out lu uur i uspuiiU lu Uio PR-l — 
proteins. The other does not correspond to known 
PR proteins and was Initially called "cluster C". 
Meanwhile the name has been changed Into C3RP- 
mRNA by reason of the discovery that it codes for 
a glycine-rich protein. This last suggests that the 
protein could be a cell wall component, comparable 
in function to the HGRP (Vamer and Cassab, 
1986). The copy DNAs (cDNAs) of the PR-1 
mRNAa have been used as a probe for isolating 
clones of PR-1 genes with a genomic library of 
tobacco; the base sequence of these has been 
clarified (Comelissen et al.. 1987). 



C. Description of the invention 

By hybridizing GRP-cDNA with a Southern blot 
of DNA of Nicotiana tabacum cv Samsun NN. It 
was found that the genome of tobacco contains 
about eight GRP genes. From a genomic library of 
Nicotiana tabacum cv. Samsun NN, four GRP 
genes were cloned. The base sequence of two of 
the cloned GRP genes was clarified. They were 
found to consist both of two exons coding for a 
protein of 109 amino acids. After splitting off a 
putative signal peptide, the mature protein consists 
as to about 25% of glycine and as to about 30% of 
charged amino acids. By Sl-nuclease mapping, it 
was found that one of the two genes analysed is 
expressed. The sequence of this gene (clone 



gGRP-8) and the flanking DNA regions is given in 
Rg. 1. The other gene is probably not expressed in 
response to virus infection. From this, and from an 
analysis of the base sequence of cloned QRP- 
cDNAs, it can be concluded that at least three of 
the eight GRP genes are expressed after vinjs 
infection. The data obtained indicate that there is 
more than 80% homology between the coding se- 
quences of the various GRP genes and also be- 
tween the upstream DNA regions. 

Fragments of the promoter region of the GRP 
gene in clone gQRP-8 were fused with the CAT- 
reporter gene. By means of the Agrobacterium 
tumefaciens technology, these constructs were In- 
tegrated into the genome of tobacco, and the trans- 
genic plants were tested for inducibility of the CAT 
gene by salicylate. In a reproducible manner, it was 
found that the first 114 nucleotides upstream of the 
transcription initiation site contain one or more ele- 
ments which cause the promoter to become induc- 
ible by salicylate. This promoter was found to be 
also induced by several other substances, includ- 
ing acrylic acid, ethylene, and ethephone. Between 
the nucleotides -400 and -645 of Hg. 1, there are 
one or more elements which greatly enhance the 
salicylate-inducible activity of the promoter. If, 
therefore, a DNA fragment carrying the sequence 
— uf nuulfa i ullUti ■643 lu »6 Is uuuploU tu d ii y y l vtj ii — 
gene, then, after transformation of plants with this 
construct, it will be possible for the gene in ques- 
tion to be induced with salicylate and several other 
specific aromaScs in a controlled manner. At this 
moment, no other plant promoters have been char- 
acterized which can be regulated with a chemical 
effector in such a simple manner. 



D. Further elaboration of the invention 

40 The invention provides broadly recombinant 
DNA comprising vector-DNA and a DNA sequence 
corresponding to, or related to, a salicylate-induc- 
ible promoter of a GRP gene of plants, 

The vector-DNA portion of the recombinant 

45 DNA according to the Invention is not critical per 
se, and is determined by the contemplated use of 
the recombinant DNA. in particular the host to be 
transformed. Those skilled In the art know what 
vectors are suitable for given hosts. Known vectors 

so which can be used In the Agrobacterium 
tumefaciens technology for the transformation of 
plants and plant cells are, for example, pAGSi27 
(van den Elzen et al. 1985) en pROKI (Baulcombe 
et al. 1988). Known vectors which can be used for 

65 cloning in bacteria, such as Escherichia coil are, for 
example, the various pUC plasmlds. 

. As well known to those skilled in the art, the 
vector DNA will commonly. In addition to an origin 
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of replication that is suitable to the host, also con- 



The novel and inventive element in the recom- 
binant DNA according to this invention consists In 
the ONA sequence which corresponds with or is 
related to, a salicylate-Inducible promoter of a GRP 
gene of plants. Hg. 1 illustrates one concrete ex- 
ample of such a GRP gene, comprising a structural 
GRP gene and flanking regulation sequences. In 
nature, however, variants occur which are com- 
prised by the present invention as far as they 
contain a salicylate-inducible promoter. The same 
applies to artificially constructed variants not dem- 
onstrated to be naturally occurring: these too are 
comprised by the present invention, provided they 
contain a salicylate inducible promoter. Of the 
flanking DNA sequences, only certain portions are 
responsible for the promoter function, the inducibil- 
ity of the promoter by salicylate, and the strength 
of either the promoter or its inducibility by salicy- 
late. Particularly In the other portions of the flanking 
regions, considerable variations are permissible. As 
regards the nucleotide sequence of the possible 
structural gene placed under the control of the 
promoter sequence, changes which do not affect 
the eventual sequence of amino acids will often be 
permissible. Changes leading to minor deviations in 
the sequence of amino acids will in many cases be 
. still without consequences for the expression and 
function of the protein. The place, length and 
nucleotide sequence of introns can generally be 
varied as well, provided thay can be processed by 
the host. 

it should be noted that the term "GRP gene", 
as used herein, means not only the DNA coding for 
GRP, but, in a broader sense, the DNA involved In 
the expression of GRP, including the DNA coding 
for GRP (designated herein as structural GRP 
gene) and flanking DNA regions with regulating 
functions, including the GRP promoter. 

Preferred embodiments of the invention de- 
scribed herein consist in the use of the GRP pro- 
moter for the following purposes. 



For the production through recombinant DNA 
techniques of proteins that have to undergo a post- 
translational modification, e.g., glycosylatlon, it Is 
recommendable to use eukariotic organisms. It is 
to be expected that, for this production. In addition 
to yeast and animal cells, plants can be used In 
future. By means of the GRP promoter, the produc- 
tion of the desired protein can be switched on at a 



controlled point of time by spraying or watering the 
plants with a solution containing millimolar quan- 
tities of sodium salicylate. This is in particular of 

impnrtanrn whan ttia prntnin tn ha prnriiirari !.«! 

5 toxic to the plant or, for example, owing to a one- 
sided amino acid composition, forms a burden for 
the plant's metabolism. The salicylate can also be 
supplied through the ground water, when a local 
effect only Is considered undesirable. In addition, in 

10 that case a separate step for rinsing off the salicy- 
late, which when dried may induce necrosis on the 
leaves, can be done without When the GRP pro- 
moter or derivatives thereof are fused with the 
code for a suitable signal peptide, there is the 

IS possibility of causing the desired protein to be 
secreted by the plant in the intercellular space of 
the leaf, from which it can be Isolated in a simple 
manner in relatively pure form. 



2. Controlled expression of genes in plants 

Another possibility is the expression of genes 
to be controlled from the outside, with the object of 
controlling certain processes in the plant which, for 
example, are of Interest for agricultural use. Thus 



genes Involved in disease resistance could be e 
pressed In a controlled manner. Also, this pro- 
moter. In combination with suitable genes involved 
30 in disease resistance, will react both rapidly and 
with great effectiveness in response to infection by 
a large group of pathogens, resulting In a more 
effective resistance reaction. This is the case, be- 
cause the original GRP gene, for example after 
35 infection of tobacco with TIVIV, is one of the fastest 
and most efficiently reacting genes. The genes in 
question, controlled by the GRP promoter, may 
originate from the plant itself, or have been Intro- 
duced from the outside and originate either from 
40 other plants or from other organisms (after being 
rendered suitable for expression and functioning of 
the gene product in the plant). 

45 3. The controlled production of commercially Inter- 
esting proteins in plant cell cultures 

Various biotechnologically oriented firms and 
institutions are at present investigating the posslbil- 

so ity of utilizing large-scale cultures of genetically 
engineered plant cells for the production of pro- 
teins or secondary metabolites. In principle, there 
is the possibility of bringing the expression of an 
economically interesting gene under the control of 

6S the GRP promoter or derivatives thereof. Through 
standard techniques, cell cultures or root cultures 
can be obtained from plant material transformed by 
the Agro bacterium tumefaciens technology with the 
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promoter/gena fusion construct in question. In such 
cell cultures, Ihe gene concerned can be induced 
at the desired moment by adding sodium salicylate 
to the culture medium in milllmolar quantities. 



hybridizing phages contained the GRP gene and 
could be subcloned in pUC9 plasmids in parts 
through standard techniques. 



I. Cloning of GRP-cDNA 

Polyadenylated RNA was isolated from tobacco 
mosaic virus infected tobacco and enriched 
through gradient centrifugation in molecules of 650 
nucleotides (Hoofd van Huijsduiinen et al., 1986). 
Using standard techniques, well known to research- 
ers in this field, the RNA could be copied by 
means of an oligo (dT) primer, reverse transcrip- 
tase and desoxyribonucleotide triphosphates In 
minus-strand DNA. Subsequently, using RNase H 
and DNA polymerase, a complementary DNA chain 
was synthesized on this DNA by the method of 
Gubler and Hoffman (1983). The double-stranded 
DNA was provided with C tails, which were 
hybridized with G talis, formed on the piasmid 
pUC9 after this had been cleaved with PstI 
(Maniatis et al., 1982). This construct was used for 



Ili.Determination of GRP promoter activity 

The construction of GRP promoter/CAT gene 
fusions Is illustrated diagrammatically in Rg. 2. A 
Hindlll fragment of gGRP-B containing the se- 
quence of nucleotides -645 to +155 was sub- 
cloned. From position +155, deletions were made 
with Ba131, whereafter the ends were provided 
with Clal linl<ers by standard techniques. Hindlll- 
Clal fragments were subcloned in the vector pUGC 
and characterized by means of sequence analysis. 
One deletion mutant (pDEL + 8) turned out to con- 
tain the sequence of from -645 to +8 and accord- 
ingly lacks the ATG initiation codon of the GRP 
gene. 

The poiyadenylation signal of the nopaline-syn- 
thase gene (Tnos) was isolated from piasmid 
pDH52 (Van Dun et all., 1987) as a 2 kb Bam HI 
fragment and cloned in plCl9H (Marsh et d.i 
1984), which yielded plC19H-Tnos. From this pias- 
mid, Tnos can be cut as a 260 bp Eco RI fragment 
and subcloned in pUCS, which produced pUC8- 
— inas. T M5 260 bp T noB fragmuiu was alsu huU- 
cloned in pDEL+a downstream of the GRP pro- 
moter. Finally, the CAT gene of transposon Tn9 
(Alton and Vapnek, 1979) was isolated as a 773 bp 
TaqI fragment from the pCaMV-CAT piasmid 
(Fromm et al., 1985) and cloned in the Clal site of 
construct pPR645, which produced piasmid 
pPRCe45. 

Fragments of pDEL + 8 were subcloned in 
pUC8-Tnos as blunt Clal fragments. Subsequently, 
the 733 bp Taq I fragment was integrated in these 
constructs with the CAT gene. The promoter frag- 
ments of pDEL + 8 fused with the CAT gene by this 
route were cut at the 3' site with Clal at position 
+ 8 and at the 5' site with the enzymes EcoRV (at 
position -400), Haelll at position -135) or Avail (at 
position -114), respectively. The conresponding 
plasmids were called pPRC400, pPRC135 and 
pPRC114. These three plasmids and the pPRC645 
piasmid were linearized with Hindlll and cloned in 
the Hind lll site of the binary transformation vector 
pAG'S127. The CaMVCAT piasmid was cloned as 
an Xbal fragment In pAGSl27. The resulting con- 
stnicte" were transferred to Agrobacterium 
tumefaclens . strain LBA4404 (Ooms et al., 1982), 
and the transconjugants were used to transform 
Samsun NN with the leaf disc method by standard 
I procedures. Transgenic plants regenerated from 
shootlats, were tested by punching discs from the 
leaf and causing these to float on on water or a 
solution of 1 ml^ salicylic acid for 24 h. Protein 



the transformation of E. coll mm-i. me iransror- 
mants were striped in duplicate on nitrocellulose 
filters. One filter was hybridized with cDNA of poly- 
(A)-RNA from TMV-infected tobacco, transcril>ed in 
vitro, the other filter was hybridized with cDNA 
igifnst poly(A)-RNA from healthy tobacco 
{Maniatis et al., 1982). Transformants hybridizing 
better with the first probe than with the second 
contained cDNA of mRNAs induced by TMV infec- 
tion. From these transformants. piasmid was iso- 
lated, the insert was subcloned in Ml 3 vectors and 
the sequence of the insert was determined by the 
method of Sanger et al. (1977). Clones with se- 
quences homologous" to the sequence of 
nucleotides given in Fig. 1 contain the GRP-cDNA. 
As an alternative to the differential hybridization 
method, the cDNA library can be searched with a 
probe consisting of a desoxyollgonucleotide syn- 
thesized on the ground of the sequence of the 
GRP axons given in Fig. 1 . 



II. Cloning of GRP genes 

DNA isolated from the nuclei of Samsun NN 
tobacco was partially digested with Sau3A I and 
cloned in the vector Charon 35 (for references, see: 
Cornellssen et al., 1987). The genomic library was 
searched with the plaque hybridization technique of 
Benton and Davis (1977), using the cDNA Isolated 
In Example I as a probe. The insert In positively 
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extracts of these discs were tested for CAT activity 
according to Gorman et al (1982). Rg. 3 shows the 
results. In lanes 1 and 2, 400 u.\ protein was used, 
in the other lanes 100 al. In lanes W and S. protein 
was used from discs floated respectively on water 
and salicylic acid. In lane 3, protein was used 
which had been isolated immediately after punch- 
ing leaf discs. Lanes 6 up to and including 11 show 
that GRP promoter sequences of 400, 135 and 114 
bp give the same degree of salicylic acid Inducible 
CAT activity. Although relatively low. this activity is 
significant, as can be seen after magnifying the 
signal in lanes 1 and 2. The construct with the 645 
bp promoter region gives a much higher activity. 
The CAT activity in the leaf discs floated on water 
(lane 4) has probably been induced through 
wounding the leaf during punching. Here again, the 
CAT activity is considerably stimulated by salicylic 
acid. The conclusion can be drawn that elements 
responsible for the salicylic acid inducibiiity are 
present in the region behveen nucleotides -114 and 
■i-8, while one or more enhancer elements are 
present Isetween nucleotides -645 and -400. 
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1. Recombinant DNA comprising vector-DNA 

and a DNA sequence conresponding with, or re- js 
lated to, a salicylate-inducible promoter of a GRP 
gene of plants. 

2. Recombinant DNA as claimed in claim 1, 
comprising vector-DNA and a salicylate-inducible 
promoter of a QRP gene of tobacco. 20 

3. Recombinant DNA as claimed in claim 1, 
comprising vector-DNA and a salicylate-inducible 
promoter of a GRP gene of Nicotiana tabacum cv. 
Samsun NN. 

4. Recombinant DNA as claimed in claim 1, 25 
comprising vector-DNA and the DNA sequence of 
nucleotide -645 to nucleotide +8 of the GRP gene 

in nlnnn qQRP-S, or a variant or portion thoroof 

having a salicylate-inducible promoter activity. 

5. Recombinant DNA as claimed In any of 30 
claims 1-4, comprising a structural gene different 
from the structural QRP gene under the control of 

the GRP promoter. 

6. Microorganisms, plant cells and plants trans- 
formed using recombinant DNA as claimed in any as 
of claims 1-5, and progeny thereof which still con- 
tain the promoter sequence introduced. 

7. A process for producing a polypeptide or 
protein by cuituring plants or plant cells capable of 
synthesizing the desired polypeptide or protein, 4o 
and isolating the polypeptide or protein produced, 
which comprises using plants which have been 
transformed, using recombinant DNA as claimed in 
claim 5 with a GRP-promoter-contrclled structural 
gens therein which codes for the desired polypep- <s 
tide or protein, and inducing the production of the 
polypeptide or protein by contacting the plants or 
plant cells with salicylate or another agent inducing 

the GRP promoter. 
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FIG.1A 



"^*t^22""*"*"'^**'"=*'*^***^*TCeGATACGGACAGGAAATGAACAGCC 
-1760 .1740 

TTTCAACTGATAAGGGACTGTTrGACATCTTGTGCTGCATTATCTTTTTCTTCATTCGTG 
•1700 .1680 

""t2M^'^^*'''^"'^**"'^^II?^*"^'^'5T"GACTCTTCCGATTCTCTCTCAA 
-1640 .1620 

ATACTTGTTCITAAATTAAAAATATGTATAAAATATCAAAAATACTTTTTATGACGTAAG 
-1580 -1560 

CTATTTrcTACGTATCAATTrAGACGACGTAATTTGGTTTAACACAAAATTTATGAAAAA 
-1520 .1500 

AIAMGACCTTTAAATATATAGACTTAAAAGCTTTGTGGGATATTTGCGTACGTATAAAA 

"^^[l2S"'**''*^***°^""?2!;*'"***""AAAGTTAAATTATTrTTAAATA 

TAAAAATATTTTATTCTGGAACGGATTAATAAAAATGTGTTATTTAATTACGAGAGTATG 
-1340 .1320 

'"ir3;i*'*'*'''"'"'*'*!;j5;"*^*^'^f"«^«"«j;5>^TAAGGAATCCA 

"*?;;S3'"''"*"**'"^;j;j""^""TTTrrGGTmAAGCGACTACTr 

TATATTAGAATTAAAAATGITTIKAGGGAGTGGTIGCTCATAGGCAK^^ 

TACTAIGTAGAGCATAACCTACACTGGGATGCCTAGCTACACTAGTTG^ 

GAGGCGIAGCAATACTATTIAACATTGGTACATCAAAAATATTAATAa 

AGACAUACTAGAAGATGGCTTATCCGAAGGTTGACAAAATTTGTTCATGTGTGTACGCC 
-»oU -960 

AGGCCTIIGCATIGAGATGTTlAGTTGCTGATCCTGCAGGAGATGnTGAGGATGAAAGG 

TGGAGGGTTGCTCAAAAAAGTGATGTTGCrCCATTCTTTGGAGTTAGACTGTGAAAATAT 
-060 -840 

TTTCTTTGTrTGACAATTAATCTTGACCTGGATTACTTGCTTTTrACTATAAAAAAATTA 
-820 -800 .780 

AATTrAAATTTATGCTTTGAGAATAAGCGTAAGTTCAACTCirTAAGAGAGGTGCAGCGA 
-740 .720 

GGATTrAAAATTTACGCGTTTGAGATTCTACTCCTTnAAGrrATGAGAGATATTTrrAG 
-680 -660 



FIG.1B 

-620 -600 
ArAACCmACTCTACTTCrccrCTAGTTCAAGACTCTCTTCAEigG^ 

GTAGCCATTrrAAACATGTTGTTrAAAATArArTCACAGTTTACAATGrATTTAAAGAir" 
-"'^ -500 Q -4 80 g 

ACCAATnCGCTCAAACriCAGG'"ACATGGCGTCCTAGAGTrrAAACCICAAAam;;;::;r 
*^oO -440 .420 

ITCAAGATArCGTATCCTAAAGTTCGAAAATTGTGTGTCCAGAAGTrrArGTon;;;:^ 
17 '^"O 18 -^80 • 17 -360 

TrAAATTAATAGTTAAAAAATTCATGACACnAArCCrAAATTTCAAAnACCATCrCM 
"IB ■ ^ -300 54 

AAAIICATGACACITAGrcCAGAATTriGGATGAATTAGCTCATCTTTTIACACATTArA 
-260 .240 

AATTGIAAArATArTrrAAATAGCGAGCTTAAAAGTGACTATTGCTGCACTTGGrCAGAC 

-200 .leo 

TTCACGTirCACTCTCTTTACT0CCACTT6 T A66CCGGmi r c rr CS T 5 T CT T TG(iTCCAC 
-160 -140 .ijn vw 

CAAT 

ASAAUATaTACATTTTCCCTCATACCTCCMGTAGTACCATTCCCTTCAATTATTTATG 

CATTCAMTCATACTATAAAGAGAACCCAAGAGTACATCAGTTTCTTCATCCCTTAATTT 
-40 .zo ,, 

CATAAGCATCATAACTAAACTTTGAACAAAAAAAGAAAACArGGGTTCTAAGCCArTTCT 
<0 .40 ~" £0 

f^LGLXLAFFFLISSEVVA fi^r 
GTTTCTTGGCCrTTGTTTGGCTTTTTTTTTCCTGATAAGCTCTGAGGTrdTAKTCGGGA 
oO 100 120 

L A E T s N P| ^ inlron 

ATTGGCTGAGACTTCCAACCGTAAGCTTACTCTCATTTTACTATGAAAAAATGAAAATCr 
"° "160 180 

CTTCTCTCATTATTTGATAIAGGATTCAACTAATAATTATTTTGTATGCATTGAGTATTT 
<00 220 240 

rAACTGTTGTAACATTCTTTAACCTTTCAAATTAGTOTTTATCAGCTACCAAAGCTCAAr 
260 280 300 

ITAGTTTCCACATCGAGCTAGrAGTTGAGTTACATTACTATCGCTATAGCTTGATAATAA 
340 3S0 



FIG.1C 



CTCTTAATATGrAGTCCTTTTATTTCATTTrAAGTGTTTTAATTrGGATCGATATGAAGI 
380 400 420 

TIAAATGAGAArGIAAGTAAAATCTTrGAATCTTGTGATTTTATAAAGTrGrAIAAAAAC 
4«0 460 480 

ATACCAAAAAArATCCTTTAAATCTTGTGGTCTrAAACArGrciTGTATAAGAACAGCCA 
500 SZO 540 

TAAAGGGTAAAAATGAGAATGGTGGAACTTAAAACCTACTTATTGATTAAATArAGAAAG 



AGrArirnCTrAAAAAATAATAAAAGCAAACAACGATACAIAAArTGAAACArATGAAG 
620 640 660 

TACTAIGTATCrrTTAAriTTCAIAATTGGTGCAclAAIGlwAfTGCATGGCCACAArGG 
680 700 720 

VOVOGRGGTNOVGGDG 



— QRQReaGGT<KRGCftYGCCR — 
TGGTCGCCGCCGTCGTGGTCCTGGTTATAAACGTAGAGGATGCCGCTATGGTICCrGCAG 
800 820 840 

I^GYUGCKRCCSYAGEAMDKV 
GAAAGGTrACAArGGrTGCAAAAGGTGTIGrTCCTACGCACGTGAGCr,CATCGATAAAGI 
360 880 900 

r £ A 0 P H U * 
CACrCAAGCTCAGCCTCACAACTrjMCATTATGrGTAATATArAAAGAGTTIAAGrTATA 
920 ^940 960 

^*'^'^^«»r*°^*^*^'^^**'^^^*^*"^^'5TGACAAGATGrAATAATcrrCCIACTTTA 
980 1000 1020 

JAGCCATrCGGrrCTlAriGATOGTTGGTCArGr 

™ tnd of cDNA 

AATGTTrTGTTGTACAATATrTTGTGACAATATGTTTCCAlArTCrrrATrTTCTTCATA 
1100 1120 1140 

TTirAGAGTAAAGCGTTrTCTrTTATTirAIGAATCCGACAATTTTCTTTrAATTTCATC 
'"0 1180 1200 

*^'^°^Sir*^**^^*^*^°**°*'^*^<5°*2ATCCAATACAACTAACGG6TrCTGGTTCAA 
"20 1240 
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